Secure Provenance for Data Preservation Repositories
نویسنده
چکیده
Importance of research data preservation and management has been accepted by the scientists all around the world. Interest and investment in data preservation projects has become higher than ever before. Already there are number of wellknown research data repositories for different types of research data. Data preservation, sharing, discovery and reuse are the key features which are common across all such repositories. Data provenance is used to track lineage or processing history of a particular data product. Capturing provenance has been identified as an important step in any scientific application. Therefore, data preservation repositories are also utilizing provenance practices mainly to enhance data discovery. However, in some situations, the complete provenance information about datasets cannot be published in preservation repositories due to various possible reasons. Therefore, such repositories should facilitate mechanisms to control the amount of provenance information exposed for outside people. In this paper, we identify the scenarios in which the conflicts between obfuscation and disclosure of provenance exists in the context of data preservation repositories. We propose a secure provenance model which is capable of preserving provenance integrity while satisfying obfuscation requirements. We build our design based on SEAD [1] repository. Keywords—Secure provenance, Provenance integrity, Data preservation, Policy.
منابع مشابه
Authenticity and Provenance in Long Term Digital Preservation: Modeling and Implementation in Preservation Aware Storage
A growing amount of digital objects is designated for long term preservation a time scale during which technologies, formats and communities are very likely to change. Specialized approaches, models and technologies are needed to guarantee the long-term understandability of the preserved data. Maintaining the authenticity (trustworthiness) and provenance (history of creation, ownership, accesse...
متن کاملDevelopments in Digital Preservation at the University of Illinois: The Hub and Spoke Architecture for Supporting Repository Interoperability and Emerging Preservation Standards
Funded by the National Digital Information Infrastructure and Preservation Program (NDIIPP), the ECHO DEPository Project supports the digital preservation efforts of the Library of Congress by contributing research and software to help society GET, SAVE, and KEEP its digital cultural heritage. Project activities include building Web archiving tools, evaluating existing repository software, deve...
متن کاملQuerying Provenance Information in Distributed Environments
The growing recognition of the importance of provenance for data intensive and multidisciplinary domains is leading to careful collection of provenance. One consequence of this is the proliferation of provenance repositories hosted for individual organization or communities, with limited ability to reconstruct and query for and on provenance across them. Community standards like the Open Proven...
متن کاملFormal Hash Compression Provenance Techniques for the Preservation of the Virtual Machine Log Auditor Environment
In this paper we provide tamper proof mechanisms for auditing old log entries as a part of lineage provenance within the virtual machine (VM) environment. For each VM provenance log record we apply SHA1 hash checksums, all encapsulated as huffman compressed codes to enforce log preservation against tampering. Our contribution establishes new formal definitions for the VM log provenance. Additio...
متن کاملSPROV 2.0: A Highly-Configurable Platform-Independent Library for Secure Provenance
Data provenance allows us to explore the lineage and derivation history of data objects. As data and its provenance flow between people and tasks in potentially untrusted environments, it becomes essential to provide integrity and confidentiality assurances for provenance. Any solution also needs to be efficient, modular, and easy to deploy. In this poster and demonstration proposal, we discuss...
متن کامل